Search CORE

515 research outputs found

Assessment of replicate bias in 454 pyrosequencing and a multi-purpose read-filtering tool

Author: Jérôme Mariette
Klopp Christophe
Noirot Céline
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Roche 454 pyrosequencing platform is often considered the most versatile of the Next Generation Sequencing technology platforms, permitting the sequencing of large genomes, the analysis of variations or the study of transcriptomes. A recent reported bias leads to the production of multiple reads for a unique DNA fragment in a random manner within a run. This bias has a direct impact on the quality of the measurement of the representation of the fragments using the reads. Other cleaning steps are usually performed on the reads before assembly or alignment. Findings PyroCleaner is a software module intended to clean 454 pyrosequencing reads in order to ease the assembly process. This program is a free software and is distributed under the terms of the GNU General Public License as published by the Free Software Foundation. It implements several filters using criteria such as read duplication, length, complexity, base-pair quality and number of undetermined bases. It also permits to clean flowgram files (.sff) of paired-end sequences generating on one hand validated paired-ends file and the other hand single read file. Conclusions Read cleaning has always been an important step in sequence analysis. The pyrocleaner python module is a Swiss knife dedicated to 454 reads cleaning. It includes commonly used filters as well as specialised ones such as duplicated read removal and paired-end read verification.</p

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

ProdInra

sigReannot: an oligo-set re-annotation pipeline based on similarities with the Ensembl transcripts and Unigene clusters

Author: Casel Pierrot
Klopp Christophe
Lagarrigue Sandrine
Moreews François
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Microarray is a powerful technology enabling to monitor tens of thousands of genes in a single experiment. Most microarrays are now using oligo-sets. The design of the oligo-nucleotides is time consuming and error prone. Genome wide microarray oligo-sets are designed using as large a set of transcripts as possible in order to monitor as many genes as possible. Depending on the genome sequencing state and on the assembly state the knowledge of the existing transcripts can be very different. This knowledge evolves with the different genome builds and gene builds. Once the design is done the microarrays are often used for several years. The biologists working in EADGENE expressed the need of up-to-dated annotation files for the oligo-sets they share including information about the orthologous genes of model species, the Gene Ontology, the corresponding pathways and the chromosomal location. Results The results of SigReannot on a chicken micro-array used in the EADGENE project compared to the initial annotations show that 23% of the oligo-nucleotide gene annotations were not confirmed, 2% were modified and 1% were added. The interest of this up-to-date annotation procedure is demonstrated through the analysis of real data previously published. Conclusion SigReannot uses the oligo-nucleotide design procedure criteria to validate the probe-gene link and the Ensembl transcripts as reference for annotation. It therefore produces a high quality annotation based on reference gene sets.</p

HAL-CentraleSupelec

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

INRIA a CCSD electronic archive server

PubMed Central

Edinburgh Research Explorer

ProdInra

HAL-Rennes 1

D-GENIES : Dot plot large GENomes in an interactive, efficient and simple way

Author: Christophe Klopp
Floréal Cabanettes
Publication venue: 'PeerJ'
Publication date: 01/01/2018
Field of study

Crossref

From high throughput 454 GS FLX data analysis process of 16S RNA gene sequences using barcoding to bacterial community exploration

Author: Cauquil Laurent
Combes Sylvie
Enjalbert Francis
Klopp Christophe
Mariette Jérôme
Rousseau Christine
Troegeler-Meynadier Annabelle
Zened Asma
Publication venue
Publication date: 01/01/2011
Field of study

From high throughput 454 GS FLX data analysis process of 16S RNA gene sequences using barcoding to bacterial community exploratio

Open Archive Toulouse Archive Ouverte

The ruminal level of trans-10 fatty acids of dairy cows is linked to the composition of bacterial community

Author: Cauquil Laurent
Combes Sylvie
Enjalbert Francis
Klopp Christophe
Mariette Jérôme
Rousseau C.
Troegeler-Meynadier Anabelle
Zened Asma
Publication venue
Publication date: 01/01/2011
Field of study

The ruminal level of trans-10 fatty acids of dairy cows is linked to the composition of bacterial communit

Open Archive Toulouse Archive Ouverte

ProdInra

Whole-genome sequencing of Aspergillus tubingensis G131 and overview of its secondary metabolism potential

Author: Choque Élodie
Klopp Christophe
Mathieu Florence
Raynal José
Valiere Sophie
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/03/2018
Field of study

Background : Black Aspergilli represent one of the most important fungal resources of primary and secondary metabolites for biotechnological industry. Having several black Aspergilli sequenced genomes should allow targeting the production of certain metabolites with bioactive properties. In this study, we report the draft genome of a black Aspergilli, A. tubingensis G131, isolated from a French Mediterranean vineyard. This 35 Mb genome includes 10,994 predicted genes. A genomic-based discovery identifies 80 secondary metabolites biosynthetic gene clusters. Genomic sequences of these clusters were blasted on 3 chosen black Aspergilli genomes: A. tubingensis CBS 134.48, A. niger CBS 513.88 and A. kawachii IFO 4308. This comparison highlights different levels of clusters conservation between the four strains. It also allows identifying seven unique clusters in A. tubingensis G131. Moreover, the putative secondary metabolites clusters for asperazine and naphtho-gamma-pyrones production were proposed based on this genomic analysis. Key biosynthetic genes required for the production of 2 mycotoxins, ochratoxin A and fumonisin, are absent from this draft genome. Even if intergenic sequences of these mycotoxins biosynthetic pathways are present, this could not lead to the production of those mycotoxins by A. tubingensis G131

Open Archive Toulouse Archive Ouverte

Directory of Open Access Journals

HAL-Artois

ProdInra

Whole-genome, deep pyrosequencing analysis of a duck influenza A virus evolution in swine cells.

Author: Bouchez Olivier
Bourret Vincent
Croville Guillaume
Guérin Jean-Luc
Klopp Christophe
Mariette Jérôme
Tiley Laurence
Publication venue: Infect Genet Evol
Publication date: 01/01/2013
Field of study

We studied the sub-population level evolution of a duck influenza A virus isolate during passage in swine tracheal cells. The complete genomes of the A/mallard/Netherlands/10-Nmkt/1999 strain and its swine cell-passaged descendent were analysed by 454 pyrosequencing with coverage depth ranging from several hundred to several thousand reads at any point. This allowed characterization of defined minority sub-populations of gene segments 2, 3, 4, 5, 7, and 8 present in the original isolate. These minority sub-populations ranged between 9.5% (for segment 2) and 46% (for segment 4) of their respective gene segments in the parental stock. They were likely contributed by one or more viruses circulating within the same area, at the same period and in the same or a sympatric host species. The minority sub-populations of segments 3, 4, and 5 became extinct upon viral passage in swine cells, whereas the minority sub-populations of segments 2, 7 and 8 completely replaced their majority counterparts. The swine cell-passaged virus was therefore a three-segment reassortant and also harboured point mutations in segments 3 and 4. The passaged virus was more homogenous than the parental stock, with only 17 minority single nucleotide polymorphisms present above 5% frequency across the whole genome. Though limited here to one sample, this deep sequencing approach highlights the evolutionary versatility of influenza viruses whereby they exploit their genetic diversity, predilection for mixed infection and reassortment to adapt to a new host environmental niche.This work was supported by a grant from DEFRA and HEFCE under the Veterinary Training and Research Initiative to the Cambridge Infectious Diseases Consortium (VB, LT), BBSRC grants BB/H014306/1 and BB/G00479X/1 (LT), and the French Ministry of Agriculture, INRA and the French Région Midi-Pyrénées (GC, J-LG, VB).This is the accepted version of the original version available at: http://dx.doi.org/10.1016/j.meegid.2013.04.03

HAL Descartes

Apollo (Cambridge)

Hal-Diderot